[opt] adapt cache blend for store and sparse's new version by wuhuxiao · Pull Request #664 · ModelEngine-Group/unified-cache-management

wuhuxiao · 2026-01-22T06:38:42Z

Purpose

What this PR does / why we need it?

fix cache blend's function

Modifications

Does this PR introduce any user-facing change?

add nvtx for blend

Test

How was this patch tested?

DATA_DIR=/home/data/kv_cache
MODEL_PATH=/home/models/mistralai/Mistral-7B-Instruct-v0.2
BLEND_DATASET_PATH=/home/datasets/LongBench/data/2wikimqa.jsonl
cd unified-cache-management
python examples/offline_inference_blend.py

wuhuxiao requested review from Infinite666, harrisonyhq, mag1c-h, qyh111, wangwenxin0312 and ygwpz as code owners January 22, 2026 06:38

wuhuxiao force-pushed the dev_gsa_device_pr_whx branch from 0da2ecf to 3b067b6 Compare January 23, 2026 07:11

wangwenxin0312 and others added 3 commits January 23, 2026 16:26

update patch for gsaondevice

d1c9665

update patch for cacheblend

80d76aa

adapt cache blend

5e7110c

wuhuxiao force-pushed the dev_gsa_device_pr_whx branch from 3b067b6 to 5e7110c Compare January 23, 2026 08:27

add update_states in base

3faba30

mag1c-h approved these changes Jan 23, 2026

View reviewed changes

mag1c-h merged commit 6000d75 into ModelEngine-Group:develop Jan 23, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[opt] adapt cache blend for store and sparse's new version#664

[opt] adapt cache blend for store and sparse's new version#664
mag1c-h merged 4 commits intoModelEngine-Group:developfrom
wuhuxiao:dev_gsa_device_pr_whx

wuhuxiao commented Jan 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wuhuxiao commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Modifications

Test

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wuhuxiao commented Jan 22, 2026 •

edited

Loading